3574 results found.
Speech/Written
Phoneme Similarity Matrices,
Language Type:
Multilingual
Languages:
English Spanish
Availability:
From Owner
License:
Please contact us
Size:
29x29 phonemes (spa) , 39x39 phonemes (eng) OtherProduction Status:
Newly created-finished
Use:
Automatic Subtitling
-
Paper title:Phoneme Similarity Matrices to Improve Long Audio Alignment for Automatic Subtitling
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Pablo Ruiz | Vicomtech-IK4 | ES | LATTICE Lab, ENS | FR |
| Author 2 | Aitor Álvarez | Vicomtech-IK4 | ES | ||
| Author 3 | Haritz Arzelus | Vicomtech-IK4 | ES | ||
| Main Contact | Pablo Ruiz | LATTICE Lab, ENS | None | LINHD, UNED | None |
Documentation:
Publicly available at the same URL, in English.
Written
Ontology,
Language Type:
Trilingual
Languages:
English Spanish french
Availability:
Freely Available
License:
Creative Commons Non Profit
Size:
2 GByte Production Status:
Newly created-finished
Use:
Semantic Web
-
Paper title:A disambiguation resource extracted from Wikipedia for semantic annotation
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Eric Charton | <Not Specified> | None | ||
| Author 2 | Michel Gagnon | <Not Specified> | None | École Polytechnique de Montréal | None |
| Main Contact | Eric Charton | École Polytechnique de Montréal | CA |
Documentation:
On websiteLanguage Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
LDC
Size:
42 GByteProduction Status:
Existing-used
Use:
Web Services
-
Paper title:Improving Cloze Test Performance of Language Learners Using Web N-Grams
-
Paper track:Applications
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Martin Potthast | Bauhaus-Universität Weimar | None | ||
| Author 2 | Matthias Hagen | Bauhaus-Universität Weimar | DE | ||
| Author 3 | Anna Beyer | Bauhaus-Universität Weimar | None | ||
| Author 4 | Benno Stein | <Not Specified> | None | Bauhaus-Universität Weimar | None |
| Main Contact | Matthias Hagen | Bauhaus-Universität Weimar | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
697 KByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Annotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Hardik Vala | McGill University | CA |
| Author 2 | Stefan Dimitrov | McGill University | CA |
| Author 3 | David Jurgens | Stanford University | US |
| Author 4 | Andrew Piper | McGill University | CA |
| Author 5 | Derek Ruths | McGill University | CA |
| Main Contact | Hardik Vala | McGill University | None |
Documentation:
<Not Specified>
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
BNC User Licence
Size:
100000000 words Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Portable Spelling Corrector for a Less-Resourced Language: Amharic
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Andargachew Mekonnen Gezmu | Otto-von-Guericke-Universität Magdeburg | DE |
| Author 2 | Andreas Nürnberger | Otto-von-Guericke-Universität Magdeburg | DE |
| Author 3 | Binyam Ephrem Seyoum | Addis Ababa University | ET |
| Main Contact | Andargachew Mekonnen Gezmu | Otto-von-Guericke-Universität Magdeburg | None |
Documentation:
Reference Guide for the British National Corpus (XML Edition) edited by Lou BurnardLanguage Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:Sublanguage Corpus Analysis Toolkit: A tool for assessing the representativeness and sublanguage characteristics of corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Irina Temnikova | Qatar Computing Research Institute | BG | Qatar Computing Research Institute, HBKU | QA |
| Author 2 | William A. Baumgartner Jr. | University of Colorado School of Medicine | US | U. Colorado School of Medicine | US |
| Author 3 | Negacy D. Hailu | University of Colorado School of Medicine | US | ||
| Author 4 | Ivelina Nikolova | Bulgarian Academy of Sciences | BG | ||
| Author 5 | Tony McEnery | Lancaster University | GB | ||
| Author 6 | Adam Kilgarriff | Lexical Computing Ltd. | GB | ||
| Author 7 | Galia Angelova | Bulgarian Academy of Sciences | BG | ||
| Author 8 | K. Bretonnel Cohen | University of Colorado School of Medicine | US | ||
| Main Contact | Irina Temnikova | Qatar Computing Research Institute, HBKU | None | Sofia University | None |
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
English Farsi Russian
Availability:
From Owner
License:
<Not Specified>
Size:
397948 Production Status:
Existing-updated
Use:
Automatic metaphor recognition
-
Paper title:Automatic Expansion of the MRC Psycholinguistic Database Imageability Ratings
-
Paper track:Terminology
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||||
|---|---|---|---|---|---|---|---|
| Author 1 | Ting Liu | ILS, University at Albany | US | ||||
| Author 10 | Nick Webb | Union College | US | ||||
| Author 11 | Umit Boz | University at Albany | US | ||||
| Author 12 | Ignacio Cases | University at Albany | US | ||||
| Author 13 | Ching-Sheng Lin | ILS Institute, University at Albany, SUNY | US | ||||
| Author 2 | Kit Cho | University of Houston-Downtown | None | SUNY - University at Albany | US | University at Albany | US |
| Author 3 | G. Aaron Broadwell | University at Albany | US | ||||
| Author 4 | Samira Shaikh | University at Albany | None | University at Albany | US | ||
| Author 5 | Tomek Strzalkowski | University at Albany | US | ||||
| Author 6 | John Lien | University at Albany | US | ||||
| Author 7 | Sarah Taylor | Sarah M. Taylor Consulting, LLC | US | ||||
| Author 8 | Laurie Feldman | University at Albany | US | ||||
| Author 9 | Boris Yamrom | University at Albany | US | ||||
| Main Contact | Ting Liu | ILS, University at Albany | None | Siena College | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons
Size:
500000 quotes OtherProduction Status:
Newly created-finished
Use:
Numerous: sentiment, NE extraction, IR etc
-
Paper title:The Minho Quotation Resource
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Author 1 | Brett Drury | <Not Specified> | None | LIAAD-INESC | None | ||||
| Author 2 | Jose Joao Almeida | <Not Specified> | None | University of Minho | None | University of Minho | PT | Universidade do Minho | None |
| Main Contact | Brett Drury | LIAAD-INESC | PT |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Owner
License:
<Not Specified>
Size:
600 tweets OtherProduction Status:
Newly created-finished
Use:
Opinion Mining/Sentiment Analysis
-
Paper title:Challenges of Evaluating Sentiment Analysis Tools on Social Media
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Diana Maynard | University of Sheffield | GB |
| Author 2 | Kalina Bontcheva | University of Sheffield | GB |
| Main Contact | Diana Maynard | University of Sheffield | None |
Documentation:
<Not Specified>
Written
,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC-BY
Size:
353 entries Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Word2Sense: Sparse Interpretable Word Embeddings
-
Paper track:Long/Word-level Semantics
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Abhishek Panigrahi | WordSim - 353 | /N |
Documentation:
Publicly available




